Neural Network-Based Head Pose Estimation and Multi-view Fusion
نویسندگان
چکیده
In this paper, we present two systems that were used for head pose estimation during the CLEAR06 Evaluation. We participated in two tasks: (1) estimating both pan and tilt orientation on synthetic, high resolution head captures, (2) estimating horizontal head orientation only on real seminar recordings that were captured with multiple cameras from different viewing angles. In both systems, we used a neural network to estimate the persons’ head orientation. In case of seminar recordings, a Bayes filter framework is further used to provide a statistical fusion scheme, integrating every camera view into one joint hypothesis. We achieved a mean error of 12.3◦ on horizontal head orientation estimation, in the monocular, high resolution task. Vertical orientation performed with 12.77◦ mean error. In case of the multi-view seminar recordings, our system could correctly identify head orientation in 34.9% (one of eight classes). If neighbouring classes were allowed, even 72.9% of the frames were correctly classified.
منابع مشابه
A System for Probabilistic Joint 3D Head Tracking and Pose Estimation in Low-Resolution, Multi-view Environments
We present a new system for 3D head tracking and pose estimation in low-resolution, multi-view environments. Our approach consists of a joint particle filter scheme, that combines head shape evaluation with histograms of oriented gradients and pose estimation by means of artificial neural networks. The joint evaluation resolves previous problems of automatic alignment and multi-sensor fusion an...
متن کاملA Unified Framework for Multi-View Multi-Class Object Pose Estimation
One core challenge in object pose estimation is to ensure accurate and robust performance for large numbers of diverse foreground objects amidst complex background clutter. In this work, we present a scalable framework for accurately inferring six Degree-of-Freedom (6-DoF) pose for a large number of object classes from single or multiple views. To learn discriminative pose features, we integrat...
متن کاملEstimating Head Pose with Neural Networks - Results on the Pointing04 ICPR Workshop Evaluation Data
In this paper we report the results of a neural network based approach to head pose esimtation on the evaluation data set provided for the Pointing04 ICPR workshop. In the presented approach, we use neural networks to estimate a person’s horizontal and vertical head orientation from facial images, which automatically were extracted from the provided data set. With our approach, we achieved an a...
متن کاملHead Pose Estimation in Seminar Room Using Multi View Face Detectors
Head pose estimation in low resolution is a challenge problem. Traditional pose estimation algorithms, which assume faces have been well aligned before pose estimation, would face much difficulty in this situation, since face alignment itself does not work well in this low resolution scenario. In this paper, we propose to estimate head pose using viewbased multi-view face detectors directly. Na...
متن کاملEndoSensorFusion: Particle Filtering-Based Multi-sensory Data Fusion with Switching State-Space Model for Endoscopic Capsule Robots using Recurrent Neural Network Kinematics
A reliable, real time multi-sensor fusion functionality is crucial for localization of actively controlled nextgeneration endoscopic capsule robots, as an emerging minimally invasive diagnostic technology for the inspection of gastrointestinal (GI) tract and diagnosis of a wide range of diseases and pathologies. In this study, we propose a novel multi-sensor fusion approach based on switching o...
متن کامل